Discovering Evolutionary Theme Patterns from Text ∗ CS 598
نویسندگان
چکیده
Temporal Text Mining (TTM) is concerned with discovering temporal patterns in text information collected over time. Since most text information bears some time stamps, TTM has many applications in multiple domains, such as summarizing events in news articles and revealing research trends in scientific literature. In this paper, we study a particular TTM task – discovering and summarizing the evolutionary patterns of themes in a text stream. The evolutionary patterns of a theme are divided into content evolution and strength evolution. We define the problem of discovering evolutionary theme patterns on both aspects and present general probabilistic methods for (1) discovering latent themes from text; (2) constructing an evolution graph of themes; and (3) analyzing life cycles of themes. Evaluation of the proposed methods on three different data collections (i.e., one news articles collection and two literature collections) shows that the proposed methods can discover interesting evolutionary theme patterns effectively.
منابع مشابه
Discovering Unknown Patterns in Free Text
Copyright © 2006, Idea Group Inc., distributing in print or electronic forms without written permission of IGI is prohibited. INTRODUCTION A very large percentage of business and academic data is stored in textual format. With the exception of metadata, such as author, date, title and publisher, these data are not overtly structured like the standard, mainly numerical, data in relational databa...
متن کاملThematic Progression Patterns in the English News and the Persian Translation
Thematic progression pattern as the method of development of the text insures that the reader follows the right path in understanding the text; in this regard, this subject is attracting considerable interest among discourse analysts. This paper calls into question the status of thematic progression in the process of translating English news into Persian. With this in mind, we analyzed the them...
متن کاملThematic Progression in the Rhetorical Sections of an Online Iraqi English Newspaper
Abstract Thematic development refers to the way theme and rheme in the clause are developed. The theory of rhetorical structure can be defined as the strategies that follow specific ways to make writing more persuasive. The present study aimed to examine how Iraqi writers maintain cohesion in the text by analyzing the patterns of thematic progression in various rhetorical sections in an online ...
متن کاملThematic Progression in the Rhetorical Sections of an Online Iraqi English Newspaper
Abstract Thematic development refers to the way theme and rheme in the clause are developed. The theory of rhetorical structure can be defined as the strategies that follow specific ways to make writing more persuasive. The present study aimed to examine how Iraqi writers maintain cohesion in the text by analyzing the patterns of thematic progression in various rhetorical sections in an online ...
متن کاملEfficient Theme and Non-Trivial Repeating Pattern Discovering in Music Databases
In this paper, we propose an approach for fast discovering all non-trivial repeating patterns in music objects. A repeating pattern is a sequence of notes which appears more than once in a music object. The longest repeating patterns in music objects are typically their themes. The themes and other non-trivial repeating patterns are important music features which can be used for both content-ba...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2005